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Abstract. A classical question in combinatorics is the following: given a par- 
tial latin square P, when can we complete P to a latin square L? In this 
paper, we investigate the class of < e-dense partial latin squares: partial 
latin squares in which each symbol, row, and column contains < era- many non- 
blank cells. Based on a conjecture of Nash-Williams, Daykin and Haggkvist 
conjectured that all < j-dense partial latin squares are completable. In this 
paper, we will discuss the proof methods and results used in previous attempts 
to resolve this conjecture, introduce a novel technique derived from a paper 
by Jacobson and Matthews on generating random latin squares, and use this 
technique to study <e-dense partial latin squares that contain < 6n 2 cells. 
In this paper, we construct completions for all <e-dense partial latin squares 

containing < <5n 2 filled cells in total, given that e < , 8 < ^ 9963^ ■ in P ar_ 
ticular, we show that all < 10 — 4 -dense partial Latin squares are completable. 
Finally, using probabilistic methods, we are able to improve these results by 
roughly a factor of two for many values of e, <5, and n. 

These results improve prior work by Gustavsson, which required e = S < 
10 -7 , as well as Chetwynd and Haggkvist, which required e = S = 10 -5 and 
n even, < 10 7 . 



1. Introduction/History 

A latin square of order n is a n x n matrix populated by the symbols {1, . . . n}, 
in which each symbol occurs at most once in every row and column; analogously, 
a partial latin square is such a n x n matrix in which we also allow some of these 
cells to be blank. We say that a n x n partial latin square is completable if there 
is some way to fill in all of its blank cells to get a latin square. This is sometimes, 
but not always possible, as the following two examples illustrate. 
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has no completion. 



Determining the classes of partial latin squares that do admit completions is a 
classic genre of problems in combinatorics. To illustrate some of these questions, we 
list a few classes of partial latin squares for which we have resolved this question. 

• Any latin rectangle (i.e. any partial latin square P where the first k rows 
of P are completely filled, while the rest are blank) can be completed (Hall 

• If P is a partial latin square all of whose nonblank entries lie within some 
set of s rows and t columns, and s + t < n, P can be completed (Ryser [5]). 
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• If P is a partial latin square with < n — 1 filled cells, P can be completed 
(Smetianuk [5]). 

• If P is a n x n partial latin square of order > 6 such that precisely two rows 
and two columns of P are filled, P can be completed (Buchanan pQ). 

In this paper, we will examine the class of < e-dense partial latin squares: 
n x n partial latin squares in which each symbol, row, and column contains < en- 
many nonblank cells. This class of partial latin squares was first introduced in a 
paper by Daykin and Haggkvist [3], where they made the following conjecture. 

Conjecture 1.1. Any <\-dense partial latin square can be completed. 

Gustavsson [4] noted in his thesis (completed under Haggkvist) that this bound 
of j was anticipated somewhat by a conjecture of Nash- Williams [7] on triangle 
decompositions of graphs. 

Conjecture 1.2. Suppose that G = (V, E) is a finite simple graph where each vertex 
has even degree, \E\ is a multiple of 3, and every vertex ofG has degree > |n. Then 
we can divide the edge set of G into \E\/Z distinct edge-disjoint triangles. 

These two conjectures are linked via the following natural bijection between 
partial latin squares and triangulations of tripartite graphs, illustrated below. 

rows 



In particular, given any < e-dense partial latin square P, the above transformation 
allows us to transform the partial latin square into a triangulated tripartite graph in 
which no vertex has more than en-many neighbors in any one given part. Therefore, 
triangulating the tripartite complement of this graph corresponds to completing P 
to a complete latin square; in this sense, the conjectured Nash- Williams degree 
bound of \n suggests the Daykin-Haggkvist conjecture that all < ^-dense partial 
latin squares are completable. 

It bears noting that this bound of \ is tight; for any c > 0, there are known 
< (4 + c) -dense partial latin squares (see Wanless [10]) that cannot be completed. 

In the 1984 paper where Daykin and Haggkvist formed this conjecture, they also 
proved the following weaker form of their claim. 

Theorem 1.3. All < 2g ]^ -dense partial latin squares are completable, whenever 
n = mod 16. 

This was strengthened in 1990, by Chetwynd and Haggkvist [2], to the following 
theorem. 

Theorem 1.4. All <10~ 5 -dense partial latin squares are completable, for n even 
and > 10 7 . 

Using Chetwynd and Haggkvist 's result along with the above tripartite graph- 
partial latin square connection, Gustavsson extended this result further in his afore- 
mentioned thesis to all values of n, in exchange for a slightly worse bound on e. 




symbols 
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Theorem 1.5. All <10 -dense partial latin squares are completable. 



The proofs in Chetwynd and Haggkvist's paper are difficult in parts, but the 
key idea behind the entire paper, trades on latin squares, is rather simple and 
elegant. We define these trades below. 

Definition 1.6. A trade is a pair of partial latin squares (P, Q) that satisfy the 
following two properties: 

• A cell (i,j) is filled in P if and only if it is filled in Q. 

• Any row or column in P contains the same symbols as that same row or 
column in Q. 

For example, the following pair of partial latin squares form a trade. 



(P,Q) = 
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In particular, Chetwynd and Haggkvist repeatedly use trades as a way to perform 
small, local modifications on rather large latin squares. This is done as follows: 
suppose that (P, Q) is a trade and L is a completion of P. Look at the array M 
formed by taking L and replacing all of P's cells in L with those of Q; by definition, 
this new array is still latin. 
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Trades of the above form, that consist of a pair of 2 x 2 subsquares, are particularly 
useful because they are the simplest trades that exist. We call these trades 2x2 
trades throughout this paper, and make frequent use of them. 

Using trades, a rough outline of Chetwynd and Haggkvist's paper can be thought 
of as the following: 

Construct a n x n latin square L in which every cell is involved in lots of 
well-understood 2x2 trades. 

For every filled cell (i,j) in P, use this structure on L to find a "simple" 
trade on L such that performing this trade causes L and P to agree at 
(i, j). ("Simple" here means that it should not be too difficult to find using 
our given structure, nor should this trade disturb the contents of too many 
cells in L.) 

If such trades can be found for every cell (i, j), such that none of these 
trades overlap (i.e. no cell is involved in more than one trade), then it is 
possible to apply all of these trades simultaneously to L. Doing this results 
in a latin square that agrees with P at every filled cell of P. 

As mentioned before, these methods work for < lCP 5 -dense partial latin squares, 
when n is even and > 10 7 ; however, they do not seem to work on denser partial latin 
squares. The main reason for this is that using small trades creates very strong 
local and global constraints on our partial latin square P and our constructed 
latin square L, in the following ways: 



(1) 
(2) 

(3) 
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• Iii general, a n x n latin square L where every cell is involved in many 
small well-understood trades does not always seem to exist. In particular, 
the Chetwynd and Haggkvist paper relies on the existence of latin squares 
L where every cell is involved in n/2 distinct 2x2 trades: however, these 
squares appear to only exist in the case that n is even. (More generally, for 
a fixed constant c, latin squares L where every cell is involved in n/c many 
2x2 trades appear to be difficult to find or construct or find whenever n 
is odd.) 

• Moreover, if we want to follow their blueprint, we will need to find a large 
collection of disjoint trades on L; namely, one for every filled cell in P. In 
doing this, we need to ensure that no given row (or column, or symbol) gets 
used "too often" in our trades; otherwise, it is possible that we will need 
to use that row at a later date to fix some other cell in P, and we will be 
unable to find a nonoverlapping trade. This is a strong local constraint, 
as it requires us to reserve in every row/ column/symbol a large swath of 
"available" cells which we have not disturbed, so that we can use their 
structure to construct future trades. This also forces us to do a lot of 
normalization work before and during the search for these trades, in order 
to preserve this structure. (This is the "difficult" part of Chetwynd and 
Haggkvist 's proof, which otherwise is as straightforward and elegant as our 
earlier outline suggests.) 

• Finally, we also have a large amount of global constraints that we are 
running into. In order to find any of these trades, we need to preserve a 
large amount of structure in L. However, using this structure means that 
we need to ensure that most of L still looks like the well-structured square 
we started with: consequently, each trade requires much more structure 
than just the cells it locally disturbs. 

Given the above issues, it may seem like the technique of using trades to complete 
partial latin squares is a dead-end. However, we can overcome many of these 
restrictions by using the concept of improper latin squares and trades (introduced 
by Jacobson and Matthews [5] in a 1996 paper on generating random latin squares.) 
We define these objects below. 

Definition 1.7. A improper latin square L is a n x n array, each cell of which 
contains a nonempty signed subset of the symbols {1, . . . n}, such that the signed 
sum of any symbol across any row or column is 1 . 
A quick example: 
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Analogously, we can define an partial improper latin square as a n x n array, 
each cell of which contains a nonempty signed subset of the symbols {1, . . . n}, such 
that the signed sum of any symbol across any row or column is either or 1, and an 
improper trade as simply a pair of partial improper latin squares that share the 
same set of filled cells and the same signed symbol sums across any row or column. 
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Essentially, improper latin squares exist so that the following kinds of things can 
be considered trades. 



P = 
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Call these trades improper 2x2 trades: in practice, these will be the only 
improper trades that we need to use. 

The main use of these improper 2x2 trades is that they let us ignore the "local" 
constraints described earlier: because we do not need a cell to be involved in a 
proper 2x2 trade in order to manipulate it, L does not require any local 2x2 
structure. In particular, this lets us use latin squares L of odd order, as it is not 
difficult to construct a latin square L of odd order with a large global number of 
2x2 subsquares (even though some cells will not involved in any 2x2 trades.) We 
will still have the global constraints noticed earlier; in general, any system that uses 
only a few pre-defined types of trades seems like it will need to have some global 
structure to guarantee that those trades will exist. However, just removing these 
local constraints gives us several advantages. 

• Using improper trades, we can complete all partial latin squares that are 
— jifi'dense, an improvement from the < 1CP 7 of Gustavsson and the 
< 10~ 5 ,n even of Chetwynd and Haggkvist. Using some additional prob- 
abilistic machinery, we can complete all n x n < 6 q 00 -dense partial Latin 
squares, if n > 25 q Q0 . 

• More interestingly, because we have removed these local constraints, we 
can now talk about completing < e-dense partial latin squares that globally 
contain < <5n 2 -many filled cells, where e and 5 may not be equal. In other 
words, we can now differentiate between our global and local constraints; 
this allows us to (in particular) massively improve our local bound e at the 
expense of our global bound 5. For example, we can use improper trades 
to complete any < -jj-dense partial latin square, provided that it globally 
contains < 2 • 10 -6 • n 2 -many filled cells. 



we can show 



• In fact, given any e <G [0, j^), and any value of S < jjjjgg 
that any < e-dense partial latin square P containing < Sn 2 filled cells in 
total is completable. 

• Furthermore, because we have removed these local constraints, we can elim- 
inate a lot of the "normalization" processes and techniques that Chetwynd 
and Haggkvist needed for their trades: consequently, these proof methods 
are (in some senses) easier to understand. 

The following process outlines how we will construct a completion of any such 
< e-dense partial latin square P containing < 5n 2 -many filled cells: 

(1) First, we will create a latin square L that globally contains a large number 
of 2 x 2 subsquares. 

(2) Then, we will show that in any fixed row or column, it is possible to ex- 
change the contents of "almost any" two cells using simple trades, provided 
that we have not disturbed too much of L's global structure. 
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(3) Using the above claim, we will show that given any filled cell (i,j) in P, 
there is a trade that causes L and P to agree at this filled cell, and that 
does not disturb any cells at which P and L previously agreed. 

(4) By repeated applications of (3), we will turn L into a completion of P, 
which is what we want. 

With our goals clearly stated and our techniques described, all that remains for 
us is to explicitly prove the above claims. 

2. The Proof 

We begin by creating latin squares with "lots" of well-understood 2x2 sub- 
squares. 

Lemma 2.1. For any n, there is a n x n latin square L that can be thought of as 
having the following structure: 

• Initially, every cell in L is involved in at least \ n/2\ -many 2x2 subsquares. 

• However, the contents of up to 3n + 7 cells in L may have been disturbed, 
and thus are cells at which this property does not hold. 

Proof. For even values of n, we can simply use the following construction used by 
Chetwynd and Haggkvist in their proof. 
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In general, their construction is the following: if we set A as the k x k circulant 
matrix on symbols {1, . . . k} and B as the k x k circulant matrix on symbols {k + 



1, . . - 2k}, we can define L as the n x n latin square given by 



A 


B 


B T 


A 1 ' 



An 



example for n — 8 is provided above. 

This latin square L, as noted by Chetwynd and Haggkvist, has the following 
property: every cell in L is involved in precisely n/2 distinct 2x2 subsquares. To 
see this, notice that another way to describe L is as the following. 

j — i + 1 mod k for i,j < k, 

i — j + 1 mod k for i > k, j > k, 

(j — i + 1 mod k) + k for i < k, j > k, 

(i — j + 1 mod k) + k for i > k, j < k, 

Take any cell in our latin square L. Pick any other cell (i,y) from the same 
row as (i, j), but from the opposite quadrant. Pick the cell (x,j) so that it has the 
same symbol as the symbol in (i,y): then we have that 

y — i + 1 = x — j + 1 mod k. 

This implies that 

y — x + 1 = i — j + I mod k; 
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i.e that the symbols in cells (i,j) and (x,y) are the same. Therefore, any cell (i,j) 
in our latin square L is involved in precisely n/2-many 2x2 subsquares: one for 
every cell in the same row and opposite quadrant. 

For n — 4k + 1 for some k, we can augment Chetwynd and Haggkvist's con- 
struction as follows. First, use the construction above to construct a 4fc x 4fc latin 
square L. Now, consider the transversal of L consisting of the following cells. 

(l,l),(2,3),(3,5),(4,7)...(fc,2fc-l), 
(k + 1, 2k + 1), (fc + 2, 2k + 3), . . . (2k, 4k - 1), 
{2k + 1, 2k + 2), (2k + 2, 2k + 4), . . . (3k, 4k) 
(3fc + l,2),(3fc + 2,4),...(4fc,2fc). 
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(The above transversal in a 13 X 13 latin square.) 



Using this transversal, turn L into a 4k + 1 x 4k + 1 latin square L' via the 
following construction: take L, and augment it by adding a new blank row and 
column. Fill each cell in this blank row with the corresponding element of our 
transversal that lies in the same column as it; similarly, fill each cell in this blank 
column with the corresponding transversal cell that is in the same row. Finally, 
replace the symbols in every cell in our transversal (as well as the blank cell at the 
intersection of our new row and column) with the symbol 4k + 1. This creates a 
n x n latin square, such that all but 3n — 2 cells are involved in precisely (n/2) — 2 
distinct 2x2 subsquares. 

For n — 4k — 1, things are slightly more difficult. While we can use our earlier 
construction to create a 4k — 2 x 4k — 2 latin square, the resulting square does not 
have a transversal. However, we can use our 2x2 substructure to slightly modify 
this square so that it will have a transversal, and then proceed as before. We outline 
the process for creating this transversal below: 

• First, use the Chetwynd and Haggkvist construction to create a (4k— 2) x 
(4k — 2) latin square L. 

• In our discussion earlier, we noted that for any pair of cells (i,k) in 
the same row but from different quadrants, there is a 2 x 2 subsquare that 
contains those two elements. Take the 2x2 subsquare corresponding to the 
cells containing 2 and 4k — 2 in the last row, and perform the 2x2 trade 
corresponding to this subsquare. 

• Similarly, take the 2x2 subsquare corresponding to the cells containing 
2k — 1 and 4k — 2 in the far-right column, and perform the 2x2 trade 
corresponding to this subsquare. 



8 



PADRAIC BARTLETT 



• With these two trades completed, look at the four cells determined by the 
last two rows and columns of our latin square. They now form a 2 x 2 



subsquare of the form 
to this 2x2 subsquare 
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4k -2 


4fc-2 


1 



Perform the trade corresponding 



Once this is done, we can find a transversal by simply taking the cells 

(l,l),(2,3),(3,5),(4,7)...(fc,2fc-l), 
(k + 1, 2k), (k + 2, 2k + 2), . . . (2k - 1, 4fc - 4), 
(2k, 2k + 1), (2k + 2, 2k + 3), . . . (3k -2,4k -3), 
{3k -1,2), (3k, 4), . . . (4k -3,2k- 2), 
(4k -2,4k -2). 
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(The above transversal in a 14 X 14 latin square.) 

Using this transversal, we can extend our latin square to a 4fc — 1 x 4fc — 1 latin 
square, using the same methods as in the n = 4k + 1 case. This completes the first 
step of our outline. □ 

Our next lemma, roughly speaking, claims the following: given any latin square 
generated by Lemma |2.1| we can pick any row and exchange the contents of "many" 
pairs of cells within that row, without disturbing other cells in that row or more 
than 16 cells in our entire latin square. Furthermore, the following result claims 
that we can do this repeatedly: i.e. that we can apply this result not just to latin 
squares generated by Lemma |2.1[ but to latin squares generated by Lemma |2.1| that 
have had the contents of up to kn 2 many cells disturbed by such trades, for some 
constant k that we will determine later. We state this claim formally below. 

Lemma 2.2. Initially, let L be one of the nxn latin squares constructed by Lemma 
|2.1[ and P be an < e-dense partial latin square. Perform some sequence of trades 
on L, and suppose that after these trades are completed that the following holds: no 
more than kn 2 of L 's cells either have had their contents altered via such trades, or 



were part of the < 3n + 7 cells that were disturbed in the execution of Lemma 2.1 
Fix any positive constant d > 0. Then, for any row r\ of L and all but 

• 2|n + en choices of C\, and 

• 4|n + 2dn + 3en + 1 choices of ci, 
there is a trade on L that 

• does not change any cells on which P and L currently agree, 
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• changes the contents of at most 16 cells of L, and 

• swaps the symbols in the cells (ri,ci) and (r 2 ,c 2 ), 
as long as the following two equations hold: 

• 3 < n — 4|n — 6dn — 6en, and 

• 12 < n - Yldn - I2en. 



kn-* 
dn 



Proof. Call a row, column, or symbol in L d-overloaded (or just overloaded, for 
short) if > dn of the entries in this row/column/symbol have had their contents 
changed by the trades we have performed thus far on L, counting the < 3n + 7 cells 
from L 's construction as such changed cells. Note that no more than 
rows, columns, or symbols are overloaded. 

Intuitively, overloaded rows are going to be "difficult" to work with: because 
most of the structure in our latin square no longer exists within that row, we will 
have relatively few ways to reliably manipulate the cells in that row. Conversely, if 
some row is not overloaded, we know that the contents of most of the cells within 
this row have not been disturbed; in theory, this will make manipulating this row 
much easier, as we will have access to a lot of the structure we have built into our 
latin square L. (Similar comments apply to overloaded symbols and columns.) 

With these comments as our motivation, we begin constructing our trade. Fix 
some row r\\ we want to show that for most pairs of cells within this row, there 
is a trade which exchanges their contents without disturbing many other cells in 
L. Naively, we might hope that for most pairs of cells in our row, we can find the 
following trade: 





If this situation occurs, we can simply perform the 2x2 trade illustrated above to 
swap the two cells containing s\ and s 2 . The issue, however, is that this situation 
may never come up: if n is an overloaded row, for example, it is entirely possible 
that none of its elements are involved in any 2x2 subsquares. To fix this, we use 
the technique of improper trades: specifically, we will choose some other row r 2 , 
and perform the improper trade 




This accomplishes our original goal of exchanging these two elements in n ; how- 
ever, we may now have an improper latin square, if either S3 7^ s 2 or S4 ^ s\. The 
aim of this lemma is to construct a proper trade on our latin square: therefore, 
we need a way to augment this trade so that it becomes a proper one. This is not 
too difficult to do: by repeatedly stringing together improper 2x2 trades that use 
nonoverloaded rows/columns/symbols wherever possible, and using cells that have 
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not been disturbed by earlier trades where possible, we can augment the improper 
trade above to one of two possible proper trades. We illustrate the first of these 
below: 



C 3 C x C 2 C 4 



C 3 Ci C 2 C 4 



S 2 



S 3 



Sq 



s 2 



S 4 



Si — Sr> 



s 4 - s 2 



r 2 



s 4 - s x 

s 2 



s 4 



s 2 _ s 3 



r 2 



The simpler trade of the two, illustrated above, occurs in the following situation. 
Look at the two cells (n, Ci) and (n, C2), and suppose that the symbols in both of 
these cells are not overloaded. For each of these two cells, there are two possibilities: 
either the cell (ri,c,) is still in the same quadrant that the symbol Si started in, 
or it has been permuted to the other quadrant. The trade we have just drawn 
above occurs in the situation where either both of these cells are still in the same 
quadrants that their corresponding symbols started in, or when neither of these 
cells are in the same quadrants that their corresponding symbols start in. (This 
condition is equivalent to asking that rows r 3 ,r4 are both in the same half of our 
square, which is necessary for our choice of r 2 .) 

We show that this trade can always be found using the following heuristic: we 
will choose the rows/columns/symbols involved in this trade one by one, choosing 
each so that as many of the variables determined by our choice are involved in 
nonoverloaded rows/columns/symbols as possible. Furthermore, we will also at- 
tempt to insure that as many cells as possible in our trade have never had their 
contents disturbed by either earlier trades on L or from being part of the potential 
3n + 7 disturbed cells in L's construction. Finally, we will also make sure that our 
choices never involve cells where L and P currently agree. 

Start by choosing si such that the following properties hold: 

• The symbol si is not overloaded. As well, if c\ is the column such that 
(ri,ci) contains si, the column c\ should also not be overloaded. This 
eliminates at most 2|n choices. 

• The cell (r\ , c\) is not one at which P and L currently agree. This eliminates 
at most en more choices. 

Therefore, we have 

n — 2—n — en 
a 

choices for si, as claimed. 

We choose the second symbol s 2 so that a similar set of properties hold: 

• The symbol s 2 should not be the same as s\. This eliminates one choice. 

• The cell in column c\ containing s 2 has not been used in any earlier trades; 
as well, the cell containing the symbol Si in the column c 2 has not been 
used by any earlier trades. Because neither C\ nor S\ are overloaded, we 



COMPLETIONS OF <e-DENSE PARTIAL LATIN SQUARES 



11 



know that at most dn entries in either of these objects have been used in 
previous trades. Therefore, this restriction eliminates at most 2dn choices. 
The rows r 3 ,r 4 containing these two undisturbed cells are not overloaded. 
As well, we ask that neither the symbol s 2 nor the column c 2 is overloaded. 
This eliminates at most 4§n choices. 

Neither the cell (ri, c 2 ) nor the two undisturbed cells are cells at which P 
and L currently agree. This eliminates at most 3en more choices. 



This leaves 



k 

4-n - 2dn - Sen - 1 
d 



choices for s 2 , again as claimed. 

We have one final choice to make here: the row r 2 . Observe that because the 
cells (n, Cj) are either both in the same quadrant that the cell containing Si in row 
r\ started in, or both permuted to the quadrants they were not in, the cells (r 3 , c\ ) 
and (r 4 , c 2 ) are both either in the top half or both in the bottom half of our latin 
square. Using this observation, we can choose r 2 so that the following conditions 
hold: 

• The row r 2 is in the opposite half from the rows r 3 , r 4 , and is not r\. This 
eliminates at most |~n/2] + 1 choices. 

• None of the cells (r 2 ,ci), (r 2 ,c 2 ), (r 2 ,c 3 ), (r 2 ,c 4 ), (r 3 ,c 4 ), or (r 4 ,c 3 ) have 
been used in prior trades. Because neither the columns Ci, c 2 , nor the 
symbols Si, s 2 , nor the rows r 3 , r 4 are overloaded, this restriction eliminates 
at most 6dn choices. 

• None of these cells are cells at which P and L currently agree. This elimi- 
nates at most 6en choices. 

This leaves 



6dn — 6en — 1 



choices for the row r 2 . 

Notice that because r 2 has been chosen to be from the opposite half of r 3 , r 4 , and 
none of these cells nor the earlier have been disturbed by earlier trades, we know 
that the symbol s 3 is in the cell (r 3 , c 4 ) and the symbol s 4 is in the cell (r 4 , c 3 ): this 
is because of L's previously-discussed "many 2 x 2 subsquares" structure. Therefore, 
whenever we can make all of these choices, we have constructed the trade that we 
claimed was possible. 

The slightly more complex trade that we have to consider is when the above 
choice of r 2 is impossible: i.e. when one of the rows r 3 , r 4 is in the top half and the 
other is in the bottom half of L. We deal with this obstruction via the following 
trade, which (again) was constructed by repeatedly applying improper 2x2 trades. 
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C 3 Ci C 4 C 5 C 2 C 6 



s 6 



S2-S5 



s 3 Hs 9 



s 4 - s 6 



C 3 Ci c 4 c 5 c 2 c 6 



s 6 _ s 2 



s 4 



Ss 



Choose si, s 2 exactly as before. We will now choose values of r 2 and s 6 so that 
the trade illustrated above exists. 

We start out with n possible choices of r 2 : for any such choice, either r 2 and r 3 
will be in different halves, or r 2 and r 4 will be in different halves of our latin square. 
Assume without loss of generality that r 2 and r 4 are in different halves of L: the 
case where r 2 and r 3 are in different halves is identical. In this case, pick r 2 such 
that the following properties hold: 

• None of the cells (r 2 ,ci), (r 2 ,c 2 ), (r 2 ,c 3 ), (r 2 ,c 4 ), (r 4 ,c 3 ), or (r 3 ,c 4 ) have 
been disturbed in prior trades. Because neither the columns c\, c 2 , nor the 
symbols S\, s 2 , nor the rows r 3 , r 4 are overloaded, this restriction eliminates 
at most 6dn choices. (Notice that because r 2 and r 4 are in different halves, 
we know that the same symbol s 4 is in (r 4 ,c 3 ) and (r 2 ,c 2 ); again, this is 
caused by L's well-understood "many 2x2 subsquares" structure. However, 
unlike our earlier case, we cannot make a similar assumption for the cell 

(>3,C 4 ).) 

• None of the following are overloaded: the symbol s 3 , the symbol S5, the row 
r 2 , or the column c 4 . Furthermore, none of the cells determined by these 
choices are in use in our trade thus far. This eliminates at most A^n + 2 
choices. 

• None of these cells are cells at which P and L currently agree. This elimi- 
nates at most 6en choices. 

This leaves at least 

k 

n — 4-n — 6dn — 6en — 2 
a 

choices. 

I A I B I 

Before making our final choice, notice that in the original | | — form of 



A 


B 


B T 


A 1 ' 



our latin square L, the symbols s 3 and S5 have to originally have came from the 
same quadrant. This is because none of the cells (ci,r 2 ), (c 4 ,r 3 ), (ci,r 3 ), (c 4 ,r 2 ) 
have been disturbed in prior trades, and the two cells (c 2 ,r 4 ), (c 3 ,r 2 ) contain the 
same symbol s 2 . (In the case where r 2 and r 3 are in opposite quadrants, a similar 
argument can be made about the symbols s 4 and s 5 .) 

Using this observation, choose s@ so that the following properties hold: 
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• Choose sq so that it is in the opposite half from the symbols S3,S5- This 
eliminates at most [n/2] choices. 

• None of the cells containing s§ in columns c±, C4 or rows r 2 , r^, nor the cell 
containing S3 in column eg, nor the cell containing 35 in column C5, have 
been used in previous trades. Because the columns ci,C4, rows r 2l r^, and 
symbols S3,ss are all not overloaded, this is possible, and eliminates at 
most Qdn choices. 

• None of these cells are places where P and L agree. This eliminates at most 
6en choices. 

• The symbol s§ has not been chosen before. This eliminates at most five 
choices. 

This leaves at least 

\n/2\ — Qdn — 6en — 5 

choices. 

Using our "many 2x2 subsquares" structure tells us that we have constructed 
the claimed trade. Therefore, as long as we can make these choices, we can find 
one of these two trades. By looking at all of the choices we make during our proof 
and choosing the potentially strictest bounds (under certain choices of d,e,k,n,) 
we can see that such trades will exist as long as 

k 

3 < n — 4— n — Qdn — 6en, and 
d 

12 < n - \2dn - 12en. 



□ 

Note that an analogous result holds for exchanging the contents of almost any 
two cells in a given column, using the exact same proof methods. 

The next lemma, built off of Lemma |2.2[ is the main tool used in this paper. 



Lemma 2.3. As before, let L be one of the n x n latin squares constructed by 



Lemma 2.1, and P be an < e-dense partial latin square. Suppose that we have 
performed a series of trades on L ( counting the < 3n + 7 trades that were possibly 
performed in the construction of L)that have changed the contents of no more than 
kn 2 of L 's cells. 

Fix any cell (ri,ci) such that P(ri,c\) is filled and does not equal L(r\,ci). 
Then, there is a trade on L that 

• does not change any cells on which P and L currently agree, 

• changes the contents of at most 69 other cells of L, and 

• causes P and L to agree at the cell {r\,c\), 
whenever we satisfy the bound 

22 < n - 12nVk - 12en. 



Proof. Let d > be some constant, corresponding to the notion of an "overloaded" 
row we introduced earlier. 

Suppose that L(ri,ci) = s x 7^ P(r l7 ci) = s 2 . Let r 2 be a row and c 2 be a 
column such that L(r 2 ,Ci) — L(r ll c 2 ) = s 2 . 
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Our goal in this lemma is to construct a trade that causes L and P to agree at 
(j"i,ci), without disturbing any cells at which P and L already agree. We will do 
this using four successive applications of Lemma |2.2| one each on row n, row r§, 
column ci, and column C3, as illustrated in the picture below: 



r 2 - 



I'4 



2 93 C A 
S 2 - ? = S3 

?Ns7 

s 9 u ?~tfs7] \ 

VA^I \— 

X ^ 

3 I 

= denotes "exchange contents 
of these two cells via Lemma 2.2" 



9i 92 93 94 



s 2 - s 4 . 

Pn Pn 

s 4 - s 2 • 



r 2 - 



9i 92 93 94 



s 4 - s 2 ■ 



I 



S 2 - S 3 ■ 



Assuming we apply Lemma |2.2| as claimed, performing the subsequent trade illus- 
trated in the diagram causes L and P to agree at the cell (ri,ci). Therefore, it 
suffices to show how we can use Lemma 2.2 as illustrated above. First, notice that 



picking the cell (r 4 , c 4 ) that contains s 4 determines the rows r 2 , r 4 and the columns 
c 2 , c 4 . Because of this, we want to choose this cell such that the following properties 
hold: 

• The cells (r 4 ,c 4 ), (r 4 ,c 2 ), (r 2 ,c 4 ) are not ones at which P and L currently 
agree. This eliminates at most Zen choices 

• The four cells (r 4 , c 4 ), (r 2 , C3), (7-3, c 2 ), (t*i, c 4 ) are all valid choices for the 



first cell to be exchanged in an application of Lemma [272] Via Lemma 2.2 

/ k+ 48 \ — — 

this eliminates at most 4 I 2 — ^-n + en j choices. (The +^| comes from the 

fact that we are applying Lemma 2.2 four consecutive times, and therefore 
on the fourth application of our lemma our latin square L may contain up 
to kn 2 + 48 disturbed cells.) Also, notice that Lemma '. 
cells are not ones at which P and L agree. 
None of these cells collide with the cells (r 4 , c 4 ), (r 4 , c 2 ), (r 2 , c\). This elim- 
inates at most three more choices. 



2.2 



insures that these 



This leaves us with 



n- 8- 



4S 

n 2 



7en — 3 



choices for this cell. 

Now, choose the symbol S3 such that the following properties hold: 

• The cells containing S3 in row r 4 and column C3 are both valid choices 
for the seco nd cell to be exchanged in an application of Lemma |2.2[ By 



Lemma 



2.2 



this eliminates at most 2 I 4 



fc+- 



2dn + Sen + 1 ) choices. 



Note that m this calculation we have already ensured that these cells are 
not ones at which P and L agree. 
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• None of these cells collide with our other cells. This eliminates at most four 
other choices. 

This leaves us with 

n — 8 — n — Adn — Gen — 6 

a 

choices of this symbol. By an identical chain of reasoning, we have precisely the 
same number of choices for S4. 

Therefore, if we can make the above pair of choices and additionally satisfy the 
bounds 

3 < n — 4 t^- n — 6dn — 6en, 

d 

12 < n - Yldn - Ylen 



required by Lemma |2.2| we can exchange the contents of the cells (ri, C4) and (r^, C2) 
with the contents of the two indicated cells containing s 3 , and similarly exchange 
the contents of (r^,c 2 ), (r!,c 4 ) with the two S4-cells. (Note that because of the ^| 
term above, we have already insured via Lemma [2~2] that none of these applications 
overlap.) 

Because each application of Lemma 2.2 disturbs the contents of at most 16 cells, 
and our final trade disturbs 5 other cells apart from (ri,ci), we have constructed 
a trade that disturbs no more than 69 cells other than (r 1; ci) whenever we can 
satisfy these five inequalities. 

Using basic calculus, it is not too difficult to see that the best choice of d for 
maximizing the values of e, k available to us is roughly \fk. Therefore, if we let 
d = Vk, we can (by comparing our five inequalities) reduce the number of bounds 
we need to consider down to just one: specifically, 

12 < n - 12nVk - 12en. 



□ 

With these lemmas established, we can now prove the central claim of this paper. 

Theorem 2.4. Any <e-dense partial latin square P containing < Sn 2 filled cells 
in total is completable, for e < j^, 8 < Qgff ■ 

Proof. Take any <e-dense partial latin square P with <5n 2 -many filled cells, and let 



L be a latin square of the same dimension as P as generated by Lemma [2. 1| Cell 
by cell, select a filled cell (ri,ci) of P at which P{r\,C\) 7^ L(ri,ci), and apply 
Lemma T2.3I to find a trade that disturbs at most 69 other cells and that causes L 
and P to agree at this cell. Again, if we can always apply this lemma, iterating 
this process will convert L into a completion of P, and thus prove our theorem. 
So: we start with a square in which at most 3n + 7 cells were disturbed, and 



proceed to disturb 69(5n more cells via our repeated applications of Lemma 2.3 If 



we want this to be possible, we merely need to choose n, S, e such that the inequality 



12 < n - 12\/69(5n 2 + 3n + 7 - 12en 
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holds. For any e < jk, we can choose any value of 

(l-12 £ ) 2 -f(l-12 £ ) + ^ 



6 < 



9936 



and our inequality will hold. For — < S, our latin square contains 5n 2 < n symbols, 
and is therefore completable via a result of Smetianuk; so we can assume that 
i > S, and thus that we can simplify our inequality to 

(1- 12e) 2 



S < 



9963 



□ 



Corollary 2.5. Any j^-dense partial latin square containing < 2 ■ 10 6 filled cells 
is completable. 

Corollary 2.6. All < 10~ 4 -dense partial latin squares are completable. 



Proof. The first corollary is immediate from Theorem |2.4| For the second: if we 
set e = 5, we are simply dealing with a < e-dense partial latin square. Whenever 
5 = e < , a < e-dense partial latin square is a latin square with no more than 1 
entry in any row, column, and symbol, and is completable (either exactly one entry 
is used in every row, column, and symbol, in which case this is a transversal and is 
easily completable; otherwise, we have a latin square with less filled cells than its 
order, which we know is completable due to a result of Smetia nuk.) Otherwise, if 
5 = e > ^, we can use the first, longer inequality in Theorem 
< 10~ 4 -dcnse partial latin squares are completable. 



2.4 



to see that all 
□ 



With some probabilistic arguments, we can improve upon these results. In par- 
ticular: given a latin square L created by Lemma |2.1| notice that each application 



of Lemma 2.3 disturbs up to 69 cells in L for each cell (ri,ci) that it makes L 
and P agree at. When L has most of its structure, this is a large overestimate: in 
many cases, we would hope to find cells (rj., ci) in P such that one 2x2 trade on 
L will make L and P agree at this cell. So: for a given cell in P, what possible 
obstructions could there be to a 2 x 2 trade existing on L that causes P and L to 
agree at this cell? 

First, it is possible that none of L's 2x2 trades can cause L and P to agree at 
this cell. Given any cell of P, there are at most |~n/2] +2 many choices of symbol 
for this cell for which no such 2x2 trades exist. 

Second: even if these 2x2 trades exist, it is possible that they "conflict" with 
each other: i.e. that some of our 2x2 trades share cells in common. In how many 
ways can this happen? Well: suppose that (ri,ci) is a filled cell in P containing 
the symbol s\. If a 2 x 2 trade exists on L that makes L and P agree at this cell, 
it is necessarily of the following form: 



X 



Call the cell (r 1; Cx) in L the overlap-cell, the cell (ri, c-i) in L the row-dependent 
cell, the cell (r 2 , Ci) in L the column-dependent cell, and the cell at (r 2 , c 2 ) in L 
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the symbol-dependent cell. If a cell in L is used in two different 2x2 subsquares, 
it occurs in each of those two squares as one of these four different possible "types" 
of cells. There are, a priori, 

(1) + (2) = 10 P° ssible 

ways in which this can happen: 
i.e. a cell could be both an overlap cell for one 2x2 trade and a row-dependent 
cell for another trade, or perhaps a symbol-dependent cell for two different 2x2 
trades. 

However, we might hope that the above difficulties only arise in pathological 
cases. In particular, given a partial latin square P, suppose that we select and 
apply random permutations to P's columns, rows, and symbols. We might hope 
that for the most part, "half" of the cells of P have potential 2x2 trades available, 
and that "most" of the potential overlap conflicts identified above don't occur. Wc 
study this in the following lemma: 

Lemma 2.7. Suppose that P is a <e-dense partial latin square containing < Sn 2 - 
many filled cells, and L is a latin square of the same dimension as P as generated 
by Lemma |2.1[ Then we can permute the rows, columns, and symbols of P in such 
a way to ensure that there are at least 5n- (I §1 — 2) cells in P with associated 2x2 
trades, such that at most 

39 

81en+ Yoo n + 97e2n2 
cells in these 2x2 trades are claimed by multiple trades. 

Proof. Take a < e-dense partial latin square P, and generate a latin square L of the 
same dimension using Lemma |2.1| Fix a < e-dense partial latin square P containing 
8n 2 many filled cells, and create a latin square L of the same dimension as P using 



Lemma 2.1 Uniformly /randomly choose three permutations of {1, ... n}, and use 
these three permutations to permute respectively the rows , columns, and symbols of 
P. Notice that it doesn't matter in which order we apply these three permutations: 
i.e. first permuting the rows and then the columns of a latin square is equivalent to 
instead permuting the columns first and then the rows. 

We start by showing that the expected number of cells in P that have corre- 
sponding 2x2 trades in P, possibly overlapping, is at least Sn ■ f [Sj — 2) . To do 
this, assume that we have already permuted the rows and columns of P, and are 
about to permute the symbols. Fix any filled cell in P; the probability that our 
random permutation of P's symbols places a symbol in this cell that has a 2 x 2 
trade is > ^~"^~ 2 ■ Summing this over all filled cells in P gives us that the expected 
number of cells with corresponding 2x2 trades is at least Sn ■ ([^J — 2). 

We now calculate the expected number of these 2x2 squares that "conflict" 
with each other: we do this by considering the ten possible ways in which a cell can 
be claimed by two different 2x2 squares, and show that for each of these specific 
types of conflict the expected number of conflicts is relatively small. 

We start by first observing that it is impossible for a cell in I to be a row- 
dependent cell for two different cells in P. This is because if the cell (ri,C2) con- 
taining Si in L was row-dependent for two different cells in P, they would have to 
both be in the same row r\ and contain the same symbol s\, by construction. Sim- 
ilarly, this argument shows that it is impossible for a cell to be column-dependent 
for two different cells in P. As well, it is impossible for a cell in L to be an overlap 
cell for two different cells in P. 
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The rest of the cases are possible. We start with the easiest case to calculate 
expected values for: counting the expected number of cells in L that arc simultane- 
ously row-dependent and overlap cells for different 2x2 trades. To do this, again 
assume that we've already permuted the rows and symbols in P. Fix a row r. In 
this row in P, there are currently < en filled cells by definition. Consequently, 
there are < en overlap cells in row r in L, and < en row-dependent cells in row r in 
L. Notice that permuting the columns of P changes the location of these overlap 
cells, but does not change the location of these row-dependent cells, as the only 
information used to determine the row-dependent cell corresponding to a filled cell 
in P is its row and symbol. 

For any individual filled cell in row r in P, the probability that our permutation 
lands it on one of these fixed cells is < — = e. Therefore, the expected number of 
cells in row r that arc both overlap cells and row-dependent cells is < en ■ e = e 2 n, 
and the expected total number of such cells in our entire latin square is < e 2 n 2 . 

An identical argument counts the (column + overlap) cells. As well, it is not 
much harder to count the number of (row + column) cells. To do this: pick any 
symbol s in L, and look at the n cells in L that contain this symbol. Call this 
set of n cells the s-set in L, for shorthand. Notice that a cell in P induces a 
row-dependent cell in our s-set if and only if it contains a s; similarly, it induces a 
column-dependent cell if and only if it contains a s. Therefore, the number of row- 
dependent cells in this s-set is always < en, as is the number of column-dependent 
cells. 

Suppose that we've already permuted the rows and symbols of P, and are about 
to permute the columns of P. As noted above, doing this does not change which 
cells in our s-set are row-dependent; however, it does change which cells are column- 
dependent. In particular, a column induces a cell in our s-set if and only if it 
contains a s. There are < en such columns. Therefore, the probability that we 
place any one of these columns in one of the en locations where it will induce a cell 
that's both row and column-dependent is < e. Thus, the total expected number of 
such collisions over our entire latin square is < e 2 n 2 . 

Counting (symbol + overlap), or (symbol + symbol) cells is harder. In particular, 
unlike our calculations above, there is no row, column or symbol in which we are 
guaranteed to have < en many symbol-dependent cells show up. However, we would 
expect that these situations are relatively rare, and that under "most" permutations 
of P's rows, columns, and symbols, these situations do not occur. We do this as 
follows. Suppose that we have already permuted the rows of P, and are about to 
permute the columns. 

(1) First, we will take any symbol s in L, and show that the expected number of 
overlap cells in L that contain s after permuting columns is < en. Therefore, 
because the only way that one of these cells is symbol-dependent is if a 
corresponding cell containing s in L is an overlap cell, the expected number 
of symbol-dependent cells corresponding to symbol s is also < en. 

(2) From there, we will also calculate the variance of this expected value, and 
show that it is < en. 

(3) We will then use this information in a similar argument to the ones given 
to enumerate (row + overlap) cells to enumerate the expected number of 
(symbol + overlap) and (symbol + symbol) cells. 
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Pick any symbol s in L, and look at the corresponding s-set in L. Fix any row 
r of P. When we permute the columns of P, the probability that one of the filled 
cells in P lands on the one cell containing s in row r of L is < e. Therefore, if we 
sum these probabilities over all n rows of L, the expected number of overlap cells 
in our s-set is < era, and thus the expected number of symbol-dependent cells in 
our s-set is also < era. 

We now calculate the variance of the number of overlap cells in our s-set. Let 
X s denote the number of overlap cells in our s-set after our column permutation, 
and Xi denote the event that the cell containing symbol s in row i is an overlap 
cell after this permutation. As noted before, Pr(xj) = e for any j. Furthermore, 
for any i ^ j, we have that Pr(xj and Xi) < e ■ "ttt- To see this, let a be the 
column incident with (row i, symbol s) in L, and b be the column incident with 
(row j, symbol s) in L. If we are permuting P's columns, have just decided which 
column a maps to, and are now deciding which column b maps to, there are at 
most en choices for b that cause \j to hold, out of n — 1 total options. Therefore, 
Pr(xi and Xj) — P r (x») ' P r (Xj given Xi) < e 1 ^rj~i an< ^ thus ^ ne variance is 



Suppose now that we've taken a < e-dense partial latin square and randomly 
permuted its rows and columns. Fix a symbol s in L, and look at the corresponding 
s-set in L. After we permute the symbols of P, what is the expected number of 
cells in our s-set that are both overlap cells and symbol-dependent cells, or symbol- 
dependent cells in two different ways? 

Well: suppose that j > X s > for some j. Notice that permuting the 

symbols of P doesn't change which of the cells in our s-set are overlap cells, while 
it does permute which cells are symbol-dependent. In particular, take any filled 
cell in P that corresponds to an overlap cell in our s-set . There are at most j 
out of the total n choices of symbol to place in this filled P-cell, that will cause 
its corresponding symbol-dependent cell in L to land on one of the j overlap cells 
in our s-sct. Therefore, the expected number of such cells over the entire s-set is 

< — 

— 3 

As well, notice that each choice of symbol creates precisely one symbol-dependent 
cell to avoid. Therefore, the probability that a given choice of symbol creates a cell 
that is symbol-dependent in two different ways is bounded above by the probability 
that choosing the last symbol for our cells in P creates such a cell, which is just 

< . = J-j- . So the expected number of such cells is < . (In the event 
that j = 1, we can do better and bound this above by n, as there are at most n 
symbols in X s .) 




=en. 
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Using similar logic, note that if X s < ken, for some k, then the expected number 
of (overlap+symbol-dependent) cells is < k 2 e 2 n and the expected number of doubly- 

,2 2 

symbol-dependent cells is < \ \ ■ 

To bound the likelihood that X s exceeds y, we can use Chebyshev's inequality 
to get the following: 

Pi j X s > j] < 1 



en 

Therefore, if we want to count the total number of these cells, we can simply split 
X s into cases: either X s is between ™ and y^j for some j in {1, . . . \^\ — 1} ; or 
X s < fcen. If we use the Chebyshev-derived inequality we discussed earlier, we can 
bound the probability that X s lands into any of these cases, and therefore bound 
the expected number of (overlap+symbol) and (symbol+symbol) cells in our s-set 
by the following: 

k 2 e 2 n + — 

1 — ke 



<81e 




<81e + > - ^ + fcVn + 



<81e + — , xo + k 2 e 2 n + 



1 - ke 

k 2 e 2 n 



4{k-l-ke) 2 1-h 



Some optimization suggests that for all of the values of e, n for which this result will 
not be superseded by Theorem \2A\ we should set k = 8. Doing this, simplifying, 
and summing this expectation over all symbols s in L gives us the following upper 
bound on the expected number of (overlap+symbol) + (symbol+symbol) cells: 

205 „„ 2 2 

81en + Tooo n 

(To perform the simplification above, we assumed that e < y^Q- In the event that 
e > j^g, the bound on the total number of nonoverlapping cells in our lemma's 
statement is vacuous; so we are free to ignore this case.) 

The number of (row + symbol) cells is counted in a similar fashion. Again, 
suppose that we have already permuted the rows of P. Fix a symbol s. If we 
permute the columns of P at random, our work above has shown that the expected 
number of symbol-dependent cells that land in our s-set is en, and that the variance 
is en. 

Suppose for the moment that the number of symbol-dependent cells in our s-set 
is < len, for some I > 1. Now, randomly permute the symbols of P. Specifically, 
choose our random permutation as follows: first select the symbol that gets mapped 
to s, and then choose where all of the remaining symbols that occur in the cells in 
P that induce symbol-dependent cells in our s-set. 
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In choosing the symbol t that maps to s, we simultaneously fix all of the cells 
in our s-set that are row-dependent, as well as all of the cells that are symbol- 
dependent that originally contained a t. In doing this, it is possible that through 
"poor luck" all of the symbol-dependent cells that used to contain a t land on 
the row-dependent cells. Thereafter, when we choose where to send the remaining 
symbols that induce symbol-dependent cells in our s-set, there are at most en row- 
dependent cells that we could induce a symbol-dependent cell in. Therefore, given 
any cell inducing a symbol-dependent cell in our s-set, the probability that mapping 
the symbol contained in that cell makes this symbol-dependent cell land on a row- 
dependent cell is < z^zT ■ Accordingly, the expected number of cells that are both 

2 2 

row-dependent and symbol-dependent generated by these choices is < l^z\- 

So, we need to just deal with the "poor luck" case above. We do this as follows: 
look at the cells in P that induce symbol-dependent cells in our s-set. Call a symbol 
"bad" if it occurs more than dlen times in this collection, for some constant d that 
we'll decide later. At most ^ symbols are bad. Therefore, in our first step, when 

we select the symbol t that maps to s, we have at most a chance of picking a 

bad symbol for t, and at least a — - chance of not doing so. In the case where 
we choose a "bad" symbol, we can simply assume that all en row-dependent cells 
are also symbol-dependent. In the case where we have not done so, we can assume 
that at most the dlen resulting symbol-dependent cells are also row-dependent. 

Consequently, the expected number of cells in our s-set that are both row- 
dependent and symbol-dependent is at most 

1 1 \ , e 2 n 2 „ 

Some simple calculus suggests that setting d — ~rzj, J = 10 is roughly optimal for 
all of the cases of n, e where this lemma is useful. If we do this, sum over all n s-sets 
in L, and use the simplifying observation that en > 1 for any nontrivial choices of 
P, we get the following upper bound on the expected number of (row+symbol) cells 
in L: 



n 



17e z n 



2„„2 



81 

The same argument counts the number of (column + symbol) cells. 

We have therefore bounded the expected number of all possible conflicts that a 
pair of 2 x 2 trades can have with each other. If we sum these bounds, we have 
that the expected total number of cells involved in multiple 2x2 trades is at most 
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81en+ — n + 166e 2 n 2 . 



□ 



Our final theorem, roughly speaking, is the claim that Lemma |2.7| does allow us 
to roughly improve the bounds of Theorem |2.4| by a factor of 2: 



Theorem 2.8. Any <e-dense partial latin square P is completable, for e,S,n such 
that 



12<n~ 12ml 366 + — + — + — + 10956e 2 - 12en. 
n n 100 • n 
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Proof. Take a <e-dense nxn partial latin square containing < 5n 2 many cells, and 
use Lemma|2.1|to construct a n x n latin square L of the same dimension. Using 



Lemma 2.7 select a permutation of P's rows/columns/symbols such that there are 
at least 



(t) = Sn ( 



81en — Yqq u ~ 166e 2 n 2 



distinct cells in P with associated nonoverlapping 2x2 trades. Apply these per- 
mutations to P; if this permuted partial latin square can be completed to some L, 
then reversing these permutations on both P and L will yield a completion of our 
original square. 

Perform all of the nonoverlapping 2x2 trades guaranteed by Lemma [2.7| this 
disturbs 3(|) cells that are not places where P and L agree. Now, cell by cell, select 
a filled cell (ri,ci) of P at which P(ri,ci) ^ L(ri,Ci), and apply Lemma 2.3 to 
find a trade that disturbs at most 69 other cells and that causes L and P to agree 
at this cell. Doing this for every remaining cell at which P, L disagree disturbs at 
most 69(<5n 2 - (J)) cells. 

In total, then, we have disturbed at most 

,2 for.? 198(5 5346e 1518 ,„n„ 2\ 2 

kn 2 := 365 H 1 h — h 10956e 2 ) n 1 

\ n n 100 • n 

cells in total by the end of our proof. 



Therefore, by Lemma 2.3 to decide whether we can perform all of these trades, 
it suffices to find constraints on 6, e, n such that we can consistently perform Lemma 
2.3| on L until it is a completion of P. In other words, it suffices to choose 6, e, n 
such that the inequality 



12 < n - 12W365 + M + — + — + 10956^ - Uen. 
n n 100 • n 



holds. □ 

For somewhat small values of e, 8 and large-ish values of n, the above formula is 
effectively 

12 < n - 12n\/36J - 12en, 



which is an improvement on the bounds of Theorem |2.4| by about a factor of 2. We 
finally note that this improvement is particularly noticeable when e = 5: 

Corollary 2.9. All g^-dense partial Latin squares are completable, for n > 25 q Q0 . 
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